ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies
نویسندگان
چکیده
We introduce EXTASEM!, a novel approach for the automatic learning of lexical taxonomies from domain terminologies. First, we exploit a very large semantic network to collect thousands of in-domain textual definitions. Second, we extract (hyponym, hypernym) pairs from each definition with a CRF-based algorithm trained on manually-validated data. Finally, we introduce a graph induction procedure which constructs a full-fledged taxonomy where each edge is weighted according to its domain pertinence. EXTASEM! achieves state-of-the-art results in the following taxonomy evaluation experiments: (1) Hypernym discovery, (2) Reconstructing gold standard taxonomies, and (3) Taxonomy quality according to structural measures. We release weighted taxonomies for six domains for the use and scrutiny of the community.
منابع مشابه
Towards an infrastructure for semantic applications: Methodologies for semantic integration of heterogeneous resources
As with many domains, information retrieval and knowledge management (IR/KM) in agriculture suffers from the problems of semantic heterogeneity, making it difficult for providers to disseminate their services effectively and for users to retrieve the information they need. Based on the analysis of resources in the domain of agriculture, this paper proposes a) application profiles for dealing wi...
متن کاملConstructing and Evaluating Controlled Bilingual Terminologies
This paper presents the construction and evaluation of Japanese and English controlled bilingual terminologies that are particularly intended for controlled authoring and machine translation with special reference to the Japanese municipal domain. Our terminologies are constructed by extracting terms from municipal website texts, and the term variations are controlled by defining preferred and ...
متن کاملOntology Merging Using Belief Revision and Defeasible Logic Programming
We combine argumentation, belief revision and description logic ontologies for extending the δ-ontologies framework to show how to merge two ontologies in which the union of the strict terminologies could lead to inconsistency. To solve this problem, we revisit a procedure presented by Falappa et al. in which part of the offending terminologies are turned defeasible by using a kernel revision o...
متن کاملIn silico structural analysis of quorum sensing genes in Vibrio fischeri
Quorum sensing controls the luminescence of Vibrio fischeri through the transcriptional activator LuxR and the specific autoinducer signal produced by luxI. Amino acid sequences of these two genes were analyzed using bioinformatics tools. LuxI consists of 193 amino acids and appears to contain five α-helices and six ß-sheets when analyzed by SSpro8. LuxI belongs to the autoinducer synthetase fa...
متن کاملHarmonizing and extending standards from a domain-specific and bottom-up approach: an example from development through use in clinical applications
OBJECTIVE Currently, the processes for harmonizing and extending standards by leveraging the knowledge within local documentation artifacts are not well described. We describe a collaborative project to develop common information models, terminology bindings, and term definitions based on nursing documentation systems, and carry the findings through to the adoption in standards development orga...
متن کامل